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We prove that the linear instability in a non-degenerate higher derivative theory, the Ostrogradski 
instability, cannot be removed by the addition of constraints. 
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I. INTRODUCTION 



When Newton wrote down his second Law of Motion 



F(q) 



(1) 



i.e. motion is described by an equation second order in 
the time derivative of the fundamental dynamical vari- 
able position q, he chose wisely. As it is now well known, 
almost two hundred years later, Ostrogradski [l| proved 
a theorem which showed that any non-degenerate theory 
whose fundamental dynamical variable is higher than 2nd 
order in time derivative there exist a linear instability. 

Consider instead, if Newton had chosen the fourth or- 
der theory 



(4) 



da 
dq 



(2) 



with, a being some function of q, i.e. a potential. This 
equation of motion can be obtained from a higher deriva- 
tive action of the following form 



a{q) 



(3) 



Since eq. ([2]) is 4th order, the phase space is 4 dimen- 
sional. Without going into too much details at the mo- 
ment - we will get there soon enough - we can describe 
the phase space by a pair of canonical variables and their 
momenta (P\,Q\) and {p2,Q2), with the Hamiltonian 



H 



P 2 



P1Q2 + -f + a(Qi). 



(4) 



One can always choose a{Qi) to be some function which 
is bounded from below, say a — Q\. More problem- 
atic however, is the first term which signals the famous 
Ostrogradski linear instability. The "linear" in "linear 
instability" refers to the linearity of the P\ in this term 
- since P\ is free to roam the phase space, there is no 
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barrier that prevents the Hamiltonian from becoming in- 
finitely negative. In other words, the Hamiltonian is not 
bounded from below [46| . 

This instability on its own is not a bad thing. It be- 
comes bad when interactions with other degrees of free- 
dom whose Hamiltonians are bounded from below are 
introduced. The presence of these negative energy states 
means that there exist a vast phase space where the 
Hamiltonian is negative, hence the modes will begin to 
populate them by entropic argument alone, and by con- 
servation of energy, creating an equally large number of 
positive energy modes in the interacting d.o.f. 0, H|. 
This is the onset of the instability. Note that, while this 
is a classical instability, in quantum theory, negative en- 
ergy modes are particularly sick - attempts to canon- 
ically quantize them will either lead to negative norm 
states (and hence undefined) or negative energy states 
(and hence runaway particle production). Since nega- 
tive norm states are often called "ghosts" in quantum 
theor y, h igher derivative theories are often called "ghost- 
like" g3. 

Recently, there is a resurgence of interest in higher 
derivative theories, particularly in attempts to modify 
gravity [4 15]. It is well known that higher derivative 
theories of the f(R) form is secretly healthy as it is 
degenerate - a technically important distinction which 
means that the highest derivative term cannot be writ- 
ten as a function of canonical variables. In fact, f(R) can 
be recast as an (interacting) theory of a scalar and two 
graviton modes |16l421| . On the other hand, theories of 
the form with curvature invariants R UV R^ V , R LLVa ^R^ v<J1 

are non- 



or the Weyl invariant C /w , . 7 C" i " 7 t 
degenerate higher derivative theories. Hence they suffer 
from the sickness of Ostrogradski. 

Furthermore, there is also great interest in the so- 
called "higher derivative" scalar field theories, such as the 
Galileon or Lovelock gravity [24H281 ] which when coupled 
non-trivially to the metric, results in interesting scalar 
field dynamics which cannot be reproduced by simple 
f(R) type modification. These theories, while naively 
look like "higher derivative" theories in the sense that in 
the Lagrangian there are present terms of 2nd order and 
higher in time derivatives, are secretly completely healthy 
non-higher-derivative theories; their E.O.M. are 2nd or- 
der in time derivatives and so are dimension 2 in phase 
space. They achieved this by the addition of structure 



iil, 



2 



in the Lagrangian - usually by the clever cancellation of 
higher derivative terms in the E.O.M. We do not con- 
sider this class of theories as higher derivative theories 
and they do not suffer from the instability. 

On the other hand, in true non-degenerate higher 
derivative theories, the instability is ubiquitous - as we 
will review below (also see [IJ), since Ostrogradski's the- 
orem is so simple to prove and requires very little initial 
assumptions, it is incredibly powerful [I. [29l433l | (See also 
48]). 

One possible way out is to impose boundary conditions 
or initial conditions such as the unstable modes are zero. 
For example, in [22j], the higher derivative terms in the 
Weyl term C 11VC , 1 C^" 1 is "removed" by imposition of 
such boundary conditions. However, this is not a sat- 
isfactory solution, as we explained above in the simple 
example with Newton's law of motion, the presence of 
any interaction will immediately source these modes and 
the instability will build up. 

Another interesting way out is suggested by Ben- 
der and Mannheim [34|, |35[ where they show that VT- 
symmetric Hamiltonians are ghost-free up to fourth or- 
der, although this require giving up Hermiticity of the 
Hamiltonian. In addition, since the classical theory in 
this case still possess negative energies, the Correspon- 
dence Principle is abandoned and the interpretation of 
this formalism is unclear. 

Finally, one might try to eliminate the instability by 
imposing constraints (for example, suggested by 3, 36, 
Ht})), i-e. one selectively restricts the trajectories of the 
d.o.f. such that the Hamiltonian becomes bounded from 
below. 

Imposition of constraints into theory requires the intro- 
duction of auxiliary variables and hence the enlargement 
of the total phase space (the dimensionality of the reduced 
phase space is still the same or smaller since trajectories 
are constrained). Hence, one may hope to change the 
orbits of the trajectories of the theory sufficiently to cure 
it of the instability. 

Using our fourth order theory example above, one can 
imagine a modification 



S = 



J dt Q«f - a(q) 



(5) 



where A is an auxiliary field which enforces the constraint 
f(liQ:U) = 0- We emphasize that the action eq. §5§ is 
a different physical theory from the action eq. ([3]) as 
long as the constraint cannot be gauged away. Can we 
cleverly choose / such that this theory, despite being a 
higher derivative theory, is free of the linear instability? 

In this paper we will prove that the answer is no, i.e. 
the instability in general cannot be removed by such im- 
position of constraints. Indeed, we will show that the 
instability persists even when the original dimensionality 
of the phase space is reduced by the imposition of con- 
straints - we will prove that the last surviving dynamical 
degree of freedom is unstable. Hence we demonstrate 



that an entire new class of non-degenerate higher deriva- 
tive theories also suffers from Ostrogradski's instability. 

The layout of the paper is as follows. In Section |TT] 
we review Ostrogradski's Theorem. In Section HID we 
prove our result for the simple case of a 2nd order (in 
the action) theory, and give some examples to illustrate 
the main points - the reader in a hurry to get to the 
gist may want to read this section only. We then show 
the general proof in Section IIVI In Section [V] we deal 
with the subtleties of the proof in the presence of gauge 
constraints. Finally, we summarize in Section [VII 



II. OSTROGRADSKI'S THEOREM: AN 
EXAMPLE 

Ostrogradski's theorem [l|,[2l| can be stated as follows: 



// the higher order time derivative Lagrangian 
is non- degenerate, there is at least one linear in- 
stability in the Hamiltonian of this system. 

Non-degeneracy means that the highest time derivative 
term can be expressed in terms of canonical variables. 
In the usual first order theory with a single degree of 
freedom q, this means that we can express the dynamical 
variable q as a function of canonical variables Q and P. 
For example, in a theory with L = (l/2)<j 2 - (l/2)g 2 , the 
canonical momentum is 



SL 



(0) 



which we can then trivially invert q = F(P,Q) = P. In 
a higher derivative theory, this translates to q( N ' can be 
express as a function of the canonical variables Qi and 
Pi. Degenerate theories, on the other hand, are non- 
invertible and are either stable on its own or may be 
made stable with the introduction of constraints [21| - 
we will not discuss such theories in this paper. 

A famous example of a higher derivative non- 
degenerate theory is the Pais-Uhlenbeck (PU) oscillator 
f38f . Here we follow the discussion in Ref . [3J, |35| . The 
PU action is given by 

S PU = J dtLpu = ^ J dt[q 2 - (w\ + wl)q 2 + w 2 w^q 2 } 

(7) 

where 7, wi, and w? are positive constants and without 
loss of generality we take w\ > u>2- The equation of 
motion of the Pais-Uhlenbeck oscillator is up to fourth 
order 



A 
dt 4 



hence requiring four initial value data (qo, <?o? Q0: Qq )> 



3 



allowing us to solve for q(t) to obtain 

9 9 (3) 

g(tj = j 2~ C0S ( w iU -r^2 sin(wit) 



W7 



Wi(w1 



Wo 



2 , ■■ 2 • , (3) 
— T COS(W 2 t) H p— g ^gr sln ( w 2U- 



u>5 



W2(wf - W%) 



(9) 



Since the solution depends on four initial value data, 
the phase space must be four dimensional, and Ostro- 
gradski's choice for the canonical coordinates is 



Qi = q 

Q2 = q 



Pi = 



SL 



PU 



Pi = 



8q 
SL PU 
Sq 



= -7(^1+^2)9 - ~q 

dL PU 



(3) 



dq 



iq- 



Non-degeneracy implies that q can be inverted and writ- 
ten as a function of the canonical variables Qi and Pi 
- here this is clearly the case. On the other hand, 
a degenerate model is always guaranteed to have con- 
straints. For example, if the model is degenerate, say if 
P-2 = SLpu/Sq is arbitrary function f(q,q) but not of q, 
then from the definition of P2 there will be a primary 
constraint P2 — f(Qi,Q2) — 0, which will reduce the 
physical degree of freedom and the final phase space will 
be smaller. 

The Hamiltonian of Pais-Uhlenbeck is as usual ob- 
tained by Legendre transforming 



H 



PU 



= Piq + P2q 
= P1Q2 1 



Lpu 

, 7 



27 



2 (wi+wl)Ql-±wiwlQl (10) 



The Hamiltonian generates the time evolution of any 
function of canonical variables F(Qi,Pi) via the Poisson 
Bracket F(Q;,P;) = [F{Q u Pi),H PU ]p, One can 
check that the evolution equations in this Hamiltonian 
formalism reproduce the Euler-Lagrange equation in the 
Lagrangian formalism, so it is the right Hamiltonian of 
the system. The Hamiltonian is conserved if the Pais- 
Uhlcnbeck Lagrangian is not explicitly dependent on 
t, thus we can as usual view the Hamiltonian as "energy" . 



As in Eq. ([TO]) , the Hamiltonian is linearly dependent 
on Pi and it means the system is unstable. The P1Q2 
term can be arbitrarily negative when P\ — > —00. The 
Hamiltonian is thus unbounded from below, which means 
that there is no well-defined vacuum state in the the- 
ory. Ostrogradski's result is that all the Hamiltonians 
of non-degenerate higher time derivative theory suffers 
from linear instabilities. 



into the theory, i.e. one cannot selectively constrain the 
unstable degrees of freedom and remove them from the 
physical phase space. We will introduce the constraints 
by auxiliary variables in such a way that there is no 
time derivative on Ai in the Lagrangian and primary con- 
straints are thus introduced through their canonical mo- 
menta, Pa ; = 0. Wc follow Dirac's method to analyze 
the higher order theory with constraints [39-43] . 

First we will show that the most general non- 
degenerate second time derivative Lagrangian with one 
extra auxiliary field (and hence a pair of second class con- 
straints) do not cure its instability. We then apply this 
result to the Pais-Uhlenbeck model. We generalize our 
result to any iV-th order non-degenerate higher derivative 
theory in Section HVl 



A. First vs Second Class Constraints 

We pause here to introduce the notion of first class and 
second class constraints. Second class constraints are 
"physical" in the sense that the solutions to the equa- 
tions of motion are different with or without the con- 
straint - e.g. a train restricted to move on a fixed rail- 
road which enforces the second class constraint. On the 
other hand, first class constraints are those associated 
with some gauge freedom in the theory, i.e. the solutions 
of the equations of motion contain some arbitrary func- 
tions of time and hence describe physically equivalent 
systems. 

As we know, one can "gauge fix" such theories - these 
so-called "gauge fixing" functions appear as new (pri- 
mary) constraints in the theory, and once introduced the 
original first class constraint and the new gauge fixing 
constraint both become second class constraints. Hence, 
when considering the instability, it is clear that once a 
general proof for second class constraints is shown, it is 
complete - physics do not depend on gauge choices after 
all. 

Having said that, there is a subtlety in when introduc- 
ing first class constraints which bears clearer elucidation: 
one is not free to choose any gauge fixing functions as 
they have to be chosen such that they only cross the 
gauge orbits once in the phase space. We will discuss 
this in section HV CI and prove it in general in Section fVl 



B. General Second Order Non-degenerate Theory 
with Second Class Constraint 



III. CONSTRAINTS DO NOT CURE 
OSTROGRADSKI'S INSTABILITY 

In this section, we will show that the Ostrogradski's in- 
stability in general cannot be cured by adding constraints 



The most general second time derivative Lagrangian 
with one auxiliary field A is given by the Lagrangian 



L = f(q,q,q,\). 



(11) 



4 



The equations of motion of this Lagrangian are 



dX 



df 

dq 



d_ 

dt 



dq 



dt 2 



df 



= 0, 



(12) 
(13) 



where the assumption of non-degeneracy means that the 
eq. (1131) is fourth order differential equation and we can 
solve q(t) and X(t) with six initial value data (qo, qo, qo, 

3q , Ao, Ao). The phase space is thus six-dimensional and 
following Ostrogradski's spirit, the choice of canonical 
variables is 



Qi = Q 
Q2 = q 
Q3 = A 



Pi 



P? 



P, 



SL _ _d_dj_ df_ 

Sq dt dq dq 

Sq dq 
5L 

Jx 



0, 



(14) 
(15) 
(16) 



where $i : P3 = is the primary constraint, here we in- 
troduce the notation ':' to denote "functional form given 
by" . The assumption of non-degeneracy allows us to use 
eq.(fT5"j) to invert q = h(Qi, Q2, Q3, Pa)- The total Hamil- 
tonian Ht of this system is 



H T = P1Q2 + 



P2h{Q 1 ,Q 2 ,Q 3 ,P2) 



f(Qi,Q 2 ,Qa,h) 
(17) 



where u\ is a function of canonical variables which can 
be found by later [49[, but since we are only interested 
in the stability of the physical degree of freedom on the 
reduced phase space, we will not write U\ explicitly. 

Since P3 — 0, consistency implies that its equation of 
motion P3 = [P^,Ht] must also vanish (on constraint) 
- this leads to a series of consistency relations which al- 
lows us to find further constraints called secondary con- 
straints. In this case, there exist one further secondary 
constraint as expected, which is 



$2 : [$i,H T ]p 



-Pi 

d£ 
dX 



dh dh 



dQ 3 dQ 3 



dq 



q—h 



d£ 
dX 



(Qi,Q2,Q 3 ,h) »0. 



(18) 



A=Q 3 



Here we introduce the weak equality symbol for the 
constraint equations. The constraint equation is written 
as $2 ~ 0, which means $2 is numerically restricted to 
be zero but does not identically vanish throughout phase 
space. I.e. $2 only vanishes on the hypersurface where 
all the constraints are satisfied. 

If $2 is dependent on Q3, then both $1 and $2 are 
second class constraints and there are no further con- 
straints from the consistency relations; further consis- 
tency relations only tell us the form of the arbitrary func- 
tion u\. Using the two second class constraints we can 
rewrite (Q3, P3) as functions of other canonical variables 



(Q3 ~ Q(Qi,Q 2 ,P2), P3 = 0). The reduced Hamiltonian 
Hr of the physical degree of freedom becomes 



H R = PiQ2 + P 2 h(Q 1 ,Q 2 ,Q(Qi, 
-f(Qi,Q 2 ,h,g). 



h,P2),P2 



(19) 



The reduced Hamiltonian is always linearly dependent 
on P\ for any conceivable Lagrangian L — f(q,q,q,X), 
which is the signal of instability. 

On the other hand, if $2 is not dependent on Q3, $1 
and $2 commute with each other. In this case, we can 
find further constraints from the consistency relations 
(and hence a reduction in the effective dimensionality 
of the phase space) and we should check whether the 
constraints are first or second class after we find all the 
constraints. We will give examples in the following sec- 
tions. 

Even in this simple example, one can quickly see why 
constraints cannot cure the instability. The instability's 
root cause is the pesky linear term P1Q2, and to fix this 
instability one must find a constraint where Q2 must be 
some function of Pi - but it is clear when generating the 
constraint $2 with the consistency relation eq. (IT8)) . Pi 
never enters the equation. A possible loophole is to sneak 
such a dependence in through clever choices of gauge fix- 
ing functions but as we mentioned previously, this also 
fail as we will demonstrate in Section IIVCI 



C. Pais-Uhlenbeck model with constraint 

We will now apply the above result to the Pais- 
Uhlcnbeck model as an example. We will consider two 
different constraints to present a flavor of how the insta- 
bility cannot evaded: q = q and q = q. In the former 
case, the dimensionality of the phase space remains the 
same (i.e. 4) with or without the constraint term, while 
in the latter case, the dimensionality of the phase space 
is reduced by 2. 



1. Constraint: q 







Firstly we consider the Pais-Uhlenbeck model with 
constraint q — q 2 = 0, with its Lagrangian given by 

Lpuc = ^[q 2 ~(w 2 i+w 2 2 )q 2 + w 2 iw 2 2 q 2 } + ^(q 2 ~q 2 ). (20) 

This model is an example where $2 is dependent on Q3. 

The equations of motion from varying with respect to 
A and q become differential equations of both variables 



q 2 - q 2 = 



(21) 



d^q , 2 

7^ +7K 



nr. 



'2, 

LL 

2) dP 



2,d 2 q 



dt 2 



+ -(Xq) = 0, (22) 



5 



and the functions q(t) and X(t) can be solve with four 
initial value data go, <?o, Ao, and Ao, thus the phase space 
of physical degree of freedom is four. Following the same 
procedure as we have done in last section, the choice of 
canonical variables is 



where A is the auxiliary field. This is an example where 
$2 is not dependent on Q3. The equations of motion 
from varying with respect to A and q are 



q 2 = 



(30) 



Qi = q 

Q2=q 
Q 3 = A 



Sq 





SL 


p 2 = 






~8j 




SL 


p 3 = 




5X 



(23) 
(24) 

(25) 



From eq. (|2"4")) . we can invert q = P2H1 + Q3), and the 
total Hamiltonian is 



H 



PUCT 



= P1Q2 + 



p2 
r 2 



1 



2(7 + 0a) + 2 [g3 + 7(w?+W22)]Q 



The primary constraint is $1 
one secondary constraint 



(26) 

P3 = 0, and there is only 



$ 2 



p2 



(7 + Q 3 ) 2 



P? 



(7 + Q3) 



±Q 2 «0. 



(27) 



Here the constraint algorithm bifurcates, and we will 
choose P 2 /( 7 + Qa) ~ Qi « over P 2 /( 7 + Q 3 ) + Q 2 « 
- one can check that choosing the other branch do not 
change the results [5(| ■ The constraints are both second 
class and we can use them to rewrite Q3, P3 as functions 
of other canonical variables. The reduced Hamiltonian 
of Pais-Uhlenbeck model with primary constraint q 2 — q 2 
is thus 

Hpucr = P1Q2 + P2Q2 + |K +w 2 2 - l)Ql - IwjwlQl 

This Hamiltonian remains linearly dependent on Pi and 
P2, hence still suffers from the instability. 



2. 







In the previous example, the dimensionality of the 
phase space remains four after the imposition of the con- 
straint. In this example, we will consider the case where 
the constraint reduces the dimensionality of the phase 
space to two. 

We consider the Lagrangian of Pais-Uhlenbeck model 



with a different constraint q 



0. 



jpuc 



r-2 1 2 1 2\ -2 2 2 2l 

■\q -{w 1 +w 2 )q +w 1 w 2 q \ 



'M 2 



(29) 



d A Q , 2 2\d 2 q , 2 
7-^j +7(^1 +w 2 ) -j^ + (jw 1 w- 



dt 2 



-A)g--(Ag) = 0. (31) 



Eq. (|30[) is a fairly powerful constraint - it means that 
once the initial value q(to) is specified q(to), q(to) and 
q( 3 \to) are all determined. We can solve q(t) and \(t) 
with two initial value data go and Ao, thus the number 
of coordinates on the reduced phase space is two and the 
physical degree of freedom is one. Following the same 
procedure as in the last section, the choice of canonical 
variables is 



Qi = q 

Q2=q 
Q 3 = A 



Sq 

+[A-7(tu? + to!)]g 
5L 

Pi = = 79 

Sq 
SL 

P 3 = -T- = 0, 



sx 



(32) 
(33) 

(34) 



and the total Hamiltonian is 
Lf 

27 



lh; < / = + ^ + l(wl + w\)Q\ - ^w 2 w 2 Q 2 



'■{Ql-QD + u^. 



(35) 



The primary constraint is given by eq. ([54]) $1 : P3 = 
and the secondary constraint obtained from the consis- 
tency relation is 



$2: Q 2 -Qi»0. 



(36) 



Here again, the constraint algorithm bifurcates, we have 
chosen Q 2 — Qi over Q 2 + Qi w 0. Since $ 2 is not 
dependent on Q3, the consistency algorithm tells us that 
there exist further constraints. Using the consistency re- 
lation $2 = [&2,Ht]p, we find an extra pair of second 
class constraints 



(37) 
(Q38) 



There is no further constraints and one can check that 
the set of constraints (<J>i, $2, $2, $3) are second class, 
and we recover the fact that the system has only a single 
degree of freedom with a reduced phase space of 2. The 
reduced Hamiltonian Hpjjcr can be obtained by using 
the four constraints to rewrite Q2, P2, Q3, P3 as functions 
ofQi.Pi 



$ 3 


-Q2^ 


p 2 

7 


$4 


p 2 


1 




7 


7 



Hpucr — PlQl 



iQl .1/2. 2\r\2 

— + 2^ + w 2)Qi 



1 



,2„,.2/ 



(39) 
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Again, the Hamiltonian is linearly dependent on Pi, and 
hence suffers from the instability. In particular, the extra 
strong constraint has reduced the effective dimensionality 
of the phase space from 4 to 2, and yet the remaining 
d.o.f. turns out to be unstable. This is a general result, 
as we will prove in Section llVI - the last surviving d.o.f. 
is always unstable. 

To complete our discussion, let us see how this exam- 
ple works in the Lagrangian picture. Namely, we want 
to check the equations of motion come from Hamilto- 
nian picture are equivalent to Euler-Lagrange equations. 
Following the usual Dirac procedure, we construct the 
matrix M a b = [$ a , $&]p, where 1 < a, b < 4. The equa- 
tion of motion for any arbitrary function of canonical 
variables F(Qi,Pj) is then given by the Dirac Bracket 
{F, Hpucr} 



F — {F, Hpucr} = 
[F, HpucrIp 



(40) 

[F^ a }pM ab [^H PUCR } P (41) 



where M ab is inverse of M b, i.e. M a t,M bc = S^. Here 
we should emphasize that no matter how we use the con- 
straints to rewrite the Hamiltonian, the equations of mo- 
tion will be equivalent on the constrained surface. 
The time evolution of Q%, 



Qi — {Qi, Hpucr} — Qi 



(42) 



reproduces the first Euler-Lagrange equation q — q = 0. 
(If we have chosen instead $2 : Qi + Qi ~ 0, then it 
will reproduce q + q = 0.) To check the second Euler- 
Lagrange equation, we use the Dirac Bracket to calculate 
the equations of motion for each canonical variable 



Q2 
Qs 
Pi 

Pi 



Qi, 

(Pi + Q1Q3 



Q'2 



(43) 
(44) 



-Pi + (-1 - wl + u>?(-l + w 2 2 )hQ u (45) 
7<9i, (46) 



and then the using the constraints eqs. (|3"3")l . ([4"3"]h and 
(|46p . reproduce the equations in the Lagrangian picture 
q = = q. Substituting the definition of Q\ and Pi 
into eq. (|45[) we recover the second equation of motion we 
found from the Euler-Lagrange equation eq. (I5T1) . 



IV. iV-TH ORDER THEORY WITH M 
AUXILIARY VARIABLES 

It is straightforward to generalize our result from the 
previous section to an iV-th order derivative theory (with 
N > 2) and M auxiliary variables. 

When we introduce constraints with M auxiliary vari- 
ables into an 7V-th order theory, it is clear that since the 
M variables are non-dynamical, they will not enlarge ef- 
fective dimensionality of the original unconstrained phase 
space which is 2N. We first consider the case where the 



number of constraints generated by M auxiliary variables 
is 2M in Section HV Al However, some auxiliary variables 
can reduce the effective dimensionality of the phase space, 
e.g. the case considered in Section fill C 21 In the latter 
case, the constraints introduced may be first class and re- 
quires special treatment - we will consider this in Section 

EvU 



A. M auxiliary variables with 2M constraints 

Consider the most general iV-th order theory with M 
auxiliary variables 



L 



N 



f(q, 9, q, ■ ■ ■ ) 9 (JV) , Ai , A 2 , . . ■ , Am)- 



(47) 



There are M + 1 Euler-Lagrange equations from vary- 
ing Ln with respect to A a and q 



df_ 

d\ a 



N 



sr- ( d Y d/ 
^ [ dt> da® 



0(a= 1,2,...,M) (48) 
0(i = 0, 1, 2, ... , N). (49) 



The total (unconstrained) phase space we start from is 
2 (TV + M) dimensional, and the canonical variables are 
chosen as follows 



N 



Qi = q 



Qn ^ 

Qn+i = Ai 

Qn+M = 



«-B-|>'-- 2£ 

3=1 



(50) 



N 



P, 



N 



Of 



P N+1 =P Xl =0 



(52) 
(53) 



P 



N+M 



Px 



0. 



(54) 



Non-degeneracy means we can solve for q 



(TV) 



as a function of Pn and Qi, i.e. 



h(Qi, . . . , Qn,Qn+i, • ■ • , Qn+M, Pn)- The total Hamil- 
tonian takes the form 

H T - PxQ2 + --- + Pn-iQn + PnKQi,...,Q n +m,Pn) 
-f{Qi,..-,QN+M,h) +u a ® a , (55) 

where $ a : Pjv+ a — are M primary constraints, with 
1 < a < M. We use the consistency relation to find the 
associated secondary constraints 



df 

$ a = [<S> a ,H T ]p : -f- 



0. 



(56) 



A a =C 
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If [$ a ,$f,]p 76 for 1 < a, b < M, both $ a and $ a 
are second class constraints and thus no further con- 
straints we can find using consistency relation - we will 
consider the case when there are further constraints in 
the next section. We can reduce M pairs of canoni- 
cal variables Qn+o,, PN+a by using the constraints, i.e. 
Qn+cl = F a (Qi, . . . , Q Nl P N ), P N+a = and the reduced 
Hamiltonian on the 2N phase space becomes 



H R = PxQ-i + ■■■ + Pn-iQn 
+P/v/i(Qi, . . . , Qn,Fh, ■ ■ ■ , Fmi 
—f{Qi, ■ ■ ■ j Qn, h, F a , . . . , Fm) 



Pn) 



(57) 



which is linearly dependent on Pi, ... , P/v-i and thus is 
necessarily unstable. Therefore we conclude that Ostro- 
gradski's instability survives. Since each auxiliary vari- 
able generates only a pair of constraints, the dimensional- 
ity of the reduced phase space is the same as the original 
theory without constraints 



Total 2(N + M) - 2M Constraints = 2N. 



(58) 



An example of this case is considered in Section llll C II 
above. 



by consistency relation [$ a ,7?T]p and [$q,, ffp]p are 
Of 



<S>a 



d\ a 







(60) 



A a =C 



$a : ™ Q (QAr +Q r°-V(Qi, • • . , Qp a +i) « 0. (61) 

Now <I Q are second class constraints, and they do not 
generate further constraints and as we have seen in last 
section, they can be used to reduce M — M' degrees of 
freedom (Q N +a, PN+a)- 

Meanwhile, for the <fr a term, we can see the constraint 
algorithm bifurcates - we can choose cither Q N+a ~ or 
g a (Qi, ■ • ■ , Qp a +i) to be the constraint <I> Q . The former 
choice gives us second class constraints and the theory is 
then identical to the previous section, i.e. we can use $ Q 
and $ Q to kill the M' degrees of freedom. 

On the other hand, if we choose the constraint <t a : 
9u{Qi, ■ ■ ■ , Q/3 a +i) ~ 0, there will be further constraints 
from consistency relation since [$ Q ,$ Q ]p ~ 0. Let us 
start from the secondary constraint $1, which depends 
on up to Qi3 ± +i 



#1 = [$ U H T ] : 



dgi 
dQ, 



Q 



t+i 



(62) 



B. M auxiliary variables with > 2M constraints 

If, on the other hand ['I'a, 'I'fcjp ~ 0, then there exist 
further constraints, and they may be first class - one 
cannot determine their class until all the constraints are 
found of course. If some of the constraints are first class, 
then we have to introduce gauge fixing functions. We 
will consider both cases in the following - but it is clear 
that the dimensionality of the reduced phase space will 
be <2N. 

The most general way to introduce the constraints such 
that [$ a ,$b]p ?» is the following rather complicated 
looking Lagrangian 

L = f(q, q, ? (jv) ,Am'+i,---, Am) 



M' 



(59) 



where n a ^ are arbitrary real numbers and /3 Q are 
positive integers where j3 a < N—l. We will use the index 
a and a to label the auxiliary variables, where 1 < a < 
M' and M' + 1 < a < M and without loss of generality, 
we assume /?i < fa < ■ ■ ■ < Pm>. We split the auxiliary 
variables A into those that explicitly couple to the iV- 
th derivative terms (in /) and those that couple only to 
[N — l)-th and smaller derivative terms. The reason why 
the latter is written in the above seemingly arbitrary way 
is to ensure that they generate further constraints as we 
will see below. 

The primary constraints are $ a : PN+a = and 
Q a : Pn+u — 0. The secondary constraints generated 



which is a function of (Qi, ■ ■ ■ Q^+i, Qp^+2)- We can 
see the consistency relation generates further constraint 
which is dependent on one more canonical variable 
coming from the last term in the sum above! We can 
keep generating further constraints if all the constraints 
commute with primary constraints as follows 

gt(Qi,...,Qp 1+ i) « 
9i(Qu- ■ ■ j Qpi+uQpi+z) ~ 



g[ N -^- 1 \Q 1 ,...,Q 01+1 ,...,Q N )^Q. 

The crucial point here is that all the constraints gener- 
ated by consistency relation (ea. (|IVB|) ) must reproduce 
®a ■ 9a{Q\, ■ ■ ■ , Qj3 a +\) ~ 0. In other words, all the con- 
straints we put in the Lagrangian need to be consistent 
and hence we are not free to choose the constraints in 
whichever way we like. 

The simplest way to achieve this is to only have a single 
auxiliary variable not coupled to the highest derivative 
term, i.e. A Q . In this case, all the constraints with higher 
derivatives are generated by consistency relation, we can 
thus guarantee the consistency of the theory. Although 
this simplest theory is consistent, there is no gauge con- 
straint since it turns out that all the constraints gener- 
ated from a single auxiliary variable are all second class. 
Indeed, this is the only way we can generate > 2M con- 
straints with M variables with all second class constraints 
(and hence is not gauged). 

Let us see how this works. Starting with the most gen- 
eral non-gauged iV-th order non-degenerate theory with 
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a single auxiliary field 

L = f(q,q,...,q^) + X n g(q,q,...,q^), (63) 

one can see this is a generalized theory of ea. (j29"]) . i.e. of 
the case when N = 2, M = 1, W = and $ = 1. The 
total Hamiltonian of this theory is 

#t = PiQ 2 + --- + Piv-i(9iv + J Pjv/i((9i,...,(3iv ) -Pjv) 
- /(Qi, ...,Q N ,h)- Q n N+1 g{Qi Qp+x) + m*i. 

(64) 

As usual the primary constraint is $i : Pjv+i = 
and secondary constraint $ 2 : ff(Qi, • ■ • , Qp+i) ~ and 
we can keep generating a tower of constraints with the 
consistency relation. First we go up the PjQj+i ladder 
in eq. (|63jl 

^ 3 (Ql,...,Qp + l,Qp +2 ) wO 

$ 4 (Ql,-..,Q/3+2,Q/3+3) WO 

$/v-/3+i(Qi,...,Q/v) «0 (65) 
$ w _^ +a (Qi,...,Q^,/i)»0 (66) 
=► $n-p+2(Qi,.-.,Qn,Pn) wfl (67) 

From eq. (|66[) to (|67[) we have explicitly written the 
constraints as functions of canonical variables ft. depends 
on (Qi, . . . Q/v, P/v) - notice that <£>/v-/3+2 has gained a 
P/v term. This allows us to turn around, and go down 
the PjQj+i ladder in the consistency relation 

$n-p+3(Qi.-.,Qn,Pn,Pn-i) wO 
®2N-2p+i{Qi ■ ■ ■ ,Qn,Pn,Pn-i ■ ■ ■ ,Pp+i) wO (68) 

$2/V-2£i+2(Ql • ■ • , Q/V, P/V ... , P/9+1, P/3, QiV+l) ~ 0. 

(69) 

From eq. (|55)) to we notice that there is a Q,a+i 

in the second last term in Ht so that brings in a depen- 
dence of Qn+i from the 3Ht I dQp+\ term in the Poisson 
Bracket. Since $2/v~2/3+2 depends on Qn+i, the next it- 
eration of the consistency relation $>2N-2p+2\p 76 0, 
there are no further constraints. We thus have 2N — 
2(3 + 2 second class constraints to reduce N — (3 + 1 
degrees of freedom ((Q/v+i, P/v+i), (Qp+i, Pp+i)), where 
i = 1,2, ... N — (3. After we impose all the constraints, 
we get our reduced Hamiltonian, which is still linearly 
dependent on Pi , . . . , Pp since the tower of constraints 
never did go below Pp 

H R = P X Q 2 + ■■■ + PpQp+i + F(Qi,Q 2 , • • ■ , Qp) + u 1 ® 1 . 

(70) 

The only stable Hamiltonian occurs when /3 = 0; in 
this case we can use the constraints to reduce N + 1 de- 
grees of freedom, but then the reduced theory is trivial. 
This is because j3 — means g( q , . . . q ^) — g( q ) , hence 



this constraint tells us q is a constant and all the deriva- 
tives on q vanishes. On the other hand, in the example 
of Section fill C 21 (3 = 1, and hence from our discussion 
here we can see that the phase space is constrained to di- 
mension 2. It is also clear that the last dynamical degree 
of freedom is unstable. 



C. Gauge constraint 

On the other hand, to have a gauge constraint we need 
to stop the consistency relation in the middle of the pro- 
cess. This is done by introducing one more auxiliary 
variable A a or X a to last example. Let us see how this is 
done. Consider the Lagrangian 

L = f( q , q ,..., gW a 2 ) + \?g(q, q, . . . , g («), (71) 

and from the definition of canonical momentum, we have 

P/v = =► q (N) = HQi, . . . , Q/v, P/v, Q/v+2). (72) 

One can see that h is now dependent on Q /v+2 ! Following 
the same process of last section, we first have primary 
constraints: $1 : P/v+i = and $2 : P/v+2 = 0, and 
secondary constraints: $2(<3i, • ■ • , Qp+i) ~ and $3 : 
9//9A2 ~ 0. Turning the consistency relation crank, we 
can get further constraints by climbing up the Pj—iQj 
ladder 

$3(<9l,---,Q/3 + l,<2/3+ 2 ) w 

§4,(Qi,...,Qf) + 2,Qf) + a) ~ 

*jv-/j+i(Qi,...,QjO«0 (73) 
$N-p+2{Qi,...,Q N ,h) wO (74) 
=> $N-P+2(Ql, • ■ ■ , Q/v, P/v, Q/v+2) « 0. (75) 

The presence of the Q/v+2 term in the last constraint 
prevents us from climbing down the ladder as the next 
iteration of the consistency relation fixes u instead of 
generating another relation. We thus have N — f3 + 4 
constraints, and among these constraints, there are four 
second class constraints $2> ^3, $N-p+i, $N-p+2 which 
can be used to reduce Q/v+2, P/v+2, Q/v, P/v- There are 
also N — f3 first class constraints $1, . . . , $/v-^- To fix 
the gauge completely, we need to introduce a number of 
gauge fixing functions equal to the number of first class 
constraints 

$i : Q N+1 = A(Q 1 ,...,Qp,P 1 ,...,Pp) 
& i+1 :Pp +i = B i (Q 1 ,,...,Qp,P 1 ,...,Pp) 

i = l,...,N - (3 -1. 

To ensure that our gauge fixing hypersurface intersects 
to each gauge orbit once and only once, we need to know 
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how does the gauge orbits look like in the phase space 
and choose the appropriate gauge fixing functions. If the 
constraints are simple, we can often guess appropriately 
- unfortunately this is rather hard especially if the gauge 
constraints are not linear functions of canonical variables. 
To prove our result, we have to rely on a formal definition 
of what it means by "intersects the gauge orbits once and 
only once" . 

The basic principle to have a healthy theory on the 
reduced phase space is that Dirac's equation is valid ev- 
erywhere on the reduced phase space. The Dirac's anal- 
ysis starts from defining matrix M a b = [$ a , <£>£,] p, where 
<J> a ,<I>b are all the first and second class constraints and 
the gauge fixing functions, and then defining the Dirac 
Bracket eq. (|4Tj). which we can then use to obtain the 
equations of motion. This requires that M a b is invertible 



det(M a b) ^ = constant, 



(76) 



where the requirement that it is constant stems from the 
fact that it has to be invertible everywhere on the reduced 
phase space hence cannot be a function of the canonical 
variables. 

We will now demonstrate this with a simple example to 
illustrate the physics, and then prove the result in general 
in Section IV] 



D. An example : Third Order Lagrangian with 
Gauge Constraints 

In this section, we will choose a simple model with 
gauge constraints to illustrate the point demonstrated 
in the previous section. The Lagrangian of the simplest 
model with gauge constraint takes the form 



L 



( g (3))S 



+ Xi(q-q) + X 2 (q - q), 



(77) 



where Ai, A2 are the auxiliary variables and we adopt 
the point of view that they are members of coordinates. 
Comparing this to cq. (|5T))) , we see that this is the case 
where M = 2, M' = 0, 

The equations of motion of varying Ai, A2, and q are 



q-q = 
q-q = 

+ X 1 +X 2 + X 1 



A 2 



0. 



(78) 



One can see Ai and A2 cannot be fully solved from the 
equations of motion; this is where the redundancy comes 
from. The canonical variables are chosen as follows 



Qi 

Q 5 



= q 

= q 

= q 

Ai 
A 2 



Pi 
P2 
P3 

Pi 
P 5 



-Ai + X 2 + q 



(5) 



-A 2 

q (3 \ 

Px 1 = 
Px 2 -- 



,(4) 



(79) 
(80) 
(81) 

(82) 
(83) 



where $1 : P4 — and $2 : P5 = are two primary 
constraints. The total Hamiltonian takes the form 



h t = P1Q2 + P2Q3 + ^ - Q4Qi - Q2) 

-Qs{Q\ - Q3) + + U2$2- 



(84) 



One can use the consistency relation to find all the 
independent secondary constraints 



$3 
$4 
$5 

$ 6 



Qi - Q2 « 0, 
Q2 - Q 3 « 0, 
Q 3 - Ps « 0, 
P3 + P2+Q5- 0. 



(85) 
(86) 
(87) 



where $1 is first class and there are naively five second 
class constraints $2 • ■ • $6- Since we know that second 
class constraints must come in pairs, one of the constraint 
must be secretly first class and this turns out to be $3 
- replace $3 -J> $3 - $ 2 : Qi ~ Q2 - P5 « 0, which is 
a first class constraint [5l|. We thus have two first class 
constraints $1, $3 and four second class constraints $2, 
$4, $5, and $ 6 . 

The first class constraints can be viewed as gauge 
transformation generators, where the transformations do 
not change the first order form action : = J dtL^ = 
J dtYl; PiQi — Ht- The variation of arbitrary functions 
of canonical variables under transformation generated by 
$1 are 



SQi = a(t)[Qi,P 4 }p = a(t)S u 
SPj = a(t)[P J ,P 4 ]p = 0. 



(89) 
(90) 



The variation of action becomes 



SS W = J dt P 4 a{t) + a(t)(Qi - Q 2 ) 
=> SS { V = J dt a{t){-P 4 + (Qi - Q 2 )) » 0, (91) 

where we have integrated by parts and used the equation 
of motion P 4 ps (Q 1 — Q 2 ) to get the last equality. In 
fact, both first and second class constraints can gener- 
ate transformations which leave first order form action 
invariant, but only first class constraints guarantee the 
transformations are still on the constrained hyper surf ace. 

Similarly, the transformations generated by $3 are 



SPi 
5P 2 
SQ 5 



miPiAQi- 

P(t)[P2:(Ql- 

0(i)[Q5,(Qi 



Q2 
Q2 



P 5 )]p 
P 5 )]p : 
-P 5 )]p 



- -m- 



(92) 
(93) 
(94) 
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The variation of action becomes 

-/3(i)Q 2 + P{t)Q 3 + /3(t)(Qi - Qs)] 
=> dS {1) = J dtp{t){-Q x + Q2 + P5 + Q1- Q2) 

« Jdt p(t) [-Q 2 + Qs + (Qi - Qs) + Qi - Q2] 
^ Jdt 2p{t){Q x - Q 2 ) » 0, (95) 

where we have integrated by part and used the equation 
of motion Qi « [Qi, Ht]p and Pi ks [Pi,Hx]p, and hence 
the action remains invariant as asserted. 

The canonical method to deal with the gauges is to in- 
troduce constraints which are hypersurfaces intersecting 
each gauge orbit once. Since the second class constraints 
relate some canonical variables with other, the most gen- 
eral gauge fixing functions we can choose are 

$7 -Qi « FiQuPx) (96) 
$ 8 : Q 5 « G{Qx,Px) « G 2 {P X ,P 2 ) (97) 

where the two different forms of (|9"T|) are related by the 
other constraints equations - we have written it in two 
different ways for different purposes below. We thus have 
eight second class constraints to reduce four degrees of 
freedom. We first use $1, $ 2 , $3, $4, $5, and <f> 6 to 
reduce Q2, Q3, P 2 , P3, Pi, and P5. The "semi- reduced" 
Hamiltonian Hsr before gauge fixing is 

H SR = P 1 Q 1 -Q±-Q 5 Q 1 . (98) 
I 



M ab = 



To have a well-defined Dirac bracket on the hypersur- 
face, M a b must be invertiblc everywhere on the reduced 
phase space, which means det M a f, ^ and not depend 
on any canonical variables, i.e. a constant. The determi- 



Naively, one might think we can choose G(Qi : P\) such 
that the reduced Hamiltonian of physical degree of free- 
dom is bounded from below, e.g. one may want to choose 
Q 5 w P 1 (—P 1 Q 1 + 1), and hence the reduced Hamilto- 
nian becomes Hr = P±Q\ — Qi/2, ridding ourselves of 
the linear Pi term. However, such a gauge fixing function 
is not allowed as we will see in what follows. 



The reason why not all gauge fixing functions are al- 
lowed is that some gauge fixing function might inter- 
sect some gauge orbits more than once. The gauge or- 
bits generated by <&i are lines parallel to Q4-axis and 
which can be perfectly gauge fixed by arbitrary func- 
tion Q4 F(Qi,Pi). On the other hand, the gauge 
orbits generated by $3 are lines paralleling to line I : 
P\ + P 2 = 0, P 2 + Q5 = 0. Arbitrary gauge fixing func- 
tion $ 8 : Qb ~ G 2 (P\,P 2 ) may intersect some gauge 
orbits more than once if G 2 (Pi,P 2 ) is not linearly de- 
pendent on Pi and P 2 - as we have seen in the example 
above. We thus require $s : Qb ~ 9iP\ + 52^2, where gi 
and g 2 are constant coefficients. 



Formally, what the above paragraph means is that if 
we want to have a healthy theory on the constrained hy- 
persurface, we need to ensure the Dirac Bracket and thus 
the time evolution equations are well-defined. Following 
Dirac's analysis of constrained system, we construct the 
matrix M a h, where 1 < a, b < 8, as follows 



nant of M a b is 

detM ob = ^ g( ^ Pl) -l)%0 (99) 

and thus the gauge fixing function $g is 

*a :Q5«ff(Qi) + cPi, where c^l. (100) 






dF 
dPi 
dF 
dPi 
dF 
dPi 
dF 
dPi 



OF dG 
9Qi 9Pi 



dF dG 
dP x 9Qi 















dF 
dPi 




dF 
dPi 




dF 
dPi 
-1 


dF 
dPi 







1 








-1 



































dG 
dPi 


dG 
dPi 


dG 
dPi 


-1 dG 
1 dPi 





dF_dG_ _ dF dG 
9Qi dPi 9Pi dQi 

-1 

dG 
dPi 
dG 
dPi 
dG 
dPi 

1 + dPi 
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Eqs. flHSJ) to ([55]). and eq. (flUO)) . and the requirement 
$8 : ~ 5i -Pi + .92P2, imply that g{Qi) is a linear func- 
tion of Qi and the final acceptable gauge fixing functions 
have the form 



$ 7 :Q 4 « P(Qi,Pi) 
$ 8 : Q 5 w dQi + c Pi 



(101) 
(102) 



where c = <?i/(l + g 2 ) 7^ 1 and d = -52/(1 + g 2 ). Sub- 
stituting all the constraints back to the Hamiltonian, we 
find that the reduced Hamiltonian of the system takes 
the form 



H R = (l- c)P 1 Q 1 - 



(1 + 2d) 



Qi- 



(103) 



Since c ^ 1, the reduced Hamiltonian is linearly depen- 
dent on Pi and we conclude that this theory remains 
linearly unstable. 



V. FIRST CLASS CONSTRAINTS DO NOT 
CURE OSTROGRADSKI INSTABILITY 

As we have mentioned, it is clear from physics argu- 
ment alone that first class constraints are gauge con- 
straints and hence fixing them would not cure the in- 
stability. Having said that, from the previous section we 
see that not all gauge fixing functions are allowed and 
hence there are subtleties. In this section, we will show 
that the requirement that the gauge fixing functions in- 
tersect the gauge orbits once and only once is sufficient 
to ensure that the instability remains. 

We will show the proof for the case of a N-th order non- 
degenerate higher derivative theory with two auxiliary 
variables - it is clear that the crucial point is the number 
of first class constraints and not the number of auxiliary 
variables and hence our proof is general. 

We want to prove the following statement: 

For any Lagrangian L = f(q, q, . . . , q( N \ A2) + 
Xig(q,q) with arbitrary N, N > 3, if we require the 
determinant of M a f, be not vanishing everywhere on the 
constrained hypersurface (i.e. detM a & = c, where c is a 
non-zero real constant.), the reduced Hamiltonian will be 
linearly dependent on canonical momentum Pi. M a i> is 
defined by 



M ab = [<f> a ,$b}p 



(104) 



and $ a includes all the first class constraints with their 
conjugate gauge fixing functions as well as all the second 
class constraints. 

We will prove this statement with two steps. First we 
demonstrate that the determinant of M ab is a product of 
det Mab and det M pcr , where A, B represent all the first 
class constraints and the gauge fixing functions and p, a 
represent all the second class constraints. We then show 
that detM pcr is always proportional to (dq 2 g(Qi, Q2)) 4 
and if we require that det M ab be a non- vanishing con- 
stant then g(q,q) = cig — g{q) - i.e. we are not free to 



choose any g in the Lagrangian. Without loss of gener- 
ality, we can choose c% = 1. 

The second step is using induction to prove the state- 
ment is true. 



A. First Step : det M ab = det Mab X det M pa 

We start with the Lagrangian 

L = f(q,q,...,q( N \\ 2 ) + \ n ig (q,q), (105) 

as usual, the canonical variables are defined by 
eqs. (JSDJ to (53]), where M = 2 and q N = 
h(Qi, . . . , Qn, Pn, Qn+2)- After using the consistency 
algorithm, there are N — 1 first class constraints, 

Pv+i = 0, 
0(Qi,Q 2 )»O, 



where Oi k = (J2i=i Qi+l^Qi) are operators with (3 < 
k < N — 1), one should notice the ordering of the op- 
erators matters. The conjugate gauge fixing functions 
are 



Qn+i-F(Qi,Pi)*0, 
P 2 -G 2 (Qi,Pi)«0, 

P fc -G fc (Qi,Pi) «0, 
and the four second class constraints are 
F N = O. 



(106) 
(107) 

(108) 



'i N ■ 
Gn = iPi N 

Pv+2 - 

df_ 

d\ 2 



I 



O il g(Q 1 ,Q 2 )^0, 
hdn N )F N ^0 



0. (109) 



We can write all the constraints as a vector with 2N + 2 
elements, 

$a - [P n +i,Qn+i-F(Qi,Pl), 
5(Qi,Q 2 ),P2-G 2 (Qi,Pi),..., 
O ih ... O tl g(Q 1 ,Q 2 ),P k - G k (Qx,Pi), 
...,F N ,G N ,P N+2 ,I]. (110) 

Now, since M ab is anti-symmetric, detM a b = [$ a , $b]p = 
(pf (ai, a 2 , . . . , a2jv+2)) 2 , where pf(...) is the pfaffian of 
anti-symmetric matrix (see Appendix IA"|) . From the de- 
composition of the pfaffian and the Poisson Bracket, the 
determinant explicitly is 
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pf(«l, ^2, • • • , 0.2N+2) — pf(«2iV+l, 0-2N+2, 0.2N-1, a 2N , 0,1, ■ ■ ■ , &2N -2) 

— pf(fl2iV+l> a2A r +2)pf(a27V-l, a2N, 0,1, . . . , a27V-2) + pf (a,2N+l, a2Jv)pf(«2JV-l 5 2 N+2, Ol,..., C12N 

= pf(a2N+l,a 2 N+2) pf(o2JV_l,02Jv)pf(ai)-- -,02JV-2) 



pf (a 2W -i, a 2fe )pf (a 2iv , ai, . . . , a 2 fe, . . . , a 2J v_ 2 ) - (2iV <^ 2iV + 2) 



fe=l 



(1 



We have used the fact that the pfaffian is anti-symmetric 
with respect to characteristics <Zj to get the first equal- 
ity. The second equality is due to the fact that the 
only non- vanishing Poisson brackets [Pn+2, &o\p are o = 
2N, 2N + 2. The third equality comes from the fact that 
pf(a2jv-i)02fc-i) vanishes for 1 < k < N + 1. The pfaf- 
fian can be further reduced if one notices that the term 
with summation vanishes, i.e. 

N-l 

pf (02N-1, 02k)l?t (02N, 0,%,..., a 2 k, • ■ • , a 2 N -2) 

fc=i 

= ± ^ y^pf(02jy-l, a2fc)pf(a2jV, «2fc) x 

fc=l i^fc 

pf (Ol, • • • , a 2 fc, . . . , 0,21, ■ ■ ■ , a 2 N-2) = 0. (112) 



The first equality comes from the fact that only non- 
vanishing Poisson bracket [Fjf(Qi, . . . , Qn), ®a] is a — 
2k, 1 < k < N + 1. This term vanishes because the 
number of a 2 fc_i is two more than the number of a 2 fc. 
When we express the last pfaffian as the basic elements 
pf(di, Oj), we are forced to have pf(a 2 j_i, a 2j -i) for each 
term in the summation, and since all the $2i-i are 
only dependent on Qi, ■ ■ ■ ,Qi, thus the Poisson bracket 
[$ 2 j_i, $2j-i] = 0. Hence eq. ([Hip becomes the final 
form 



pf(ai,a 2 , . . . ,a 2W + 2 ) 



pf(ai, 
pf(ai, 



■ , 02N-2) 



pf (a 2 N+i, a 2 Ar+ 2 )pf (a 2 jv-i, o, 2 n) ~ (2N o 2N + 2) 



. ,a 2 jv- 2 ) x (dQ N+2 GNdQ N F N dp N I — dQ N+2 IdQ N F N dp N GN) 



(113) 



where pf (ai, . . . , a 2 jy_ 2 ) is the pfaffian of all the first class 
constraints with their conjugate gauge fixing functions. 
From eq. (|113p . we notice that the pfaffian only see the 
term dependent on Qn in Fn and the term dependent 
on h in Gat. With eq. (|109p . we can see both the term 
dependent on Qn in Fn and the term dependent on h in 
Gn are proportional to dQ 2 g{Q\, Q2), thus the pfaffian is 

proportional to ( dQ 2 g(Qi, Q2)] ■ To have a well-defined 
theory on the reduced phase space, M a b must be every- 
where invertible on the reduced phase space, which is 
means that det M a b 7^ = constant. Hence dQ 2 g(Q\, Q 2 ) 
must also be a constant, so the general form of 



fl(Qi,Oa) = c 1 Qa-ff(Oi), (H4) 



B. Second Step: Induction 



From the first step we know the pfaffian of M a & is prod- 
uct of pi(M AB ) and pi(M pa ), where 1 < A, B < 2N - 2 
and 2N — 1 < p,a < 2N + 2 and the constraint we in- 
troduce in the Lagrangian needs to be the form g{q, q) = 
q—g{q). In this section, we will use induction to complete 
the proof. 



Let us start from N = 3 case, where the Lagrangian 
takes the form 



where g{Qi) is some arbitrary function of Q\. We can 
choose c\ = 1 without loss of generality. 



L = f(q, q, q, q {3} , A 2 ) + A" (q - g(q)), (115) 
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the canonical variables are defined as 



and 



Ai 
A 2 



Pi = 
P 2 = 

P3 = 

Pi = 
P5 = 



df d df d 2 df 



dq dt dq dt 2 dq^ 

df d df 

da dt dqW 
df 

Px, =0 
Px 2 = 0. 



(116) 
(117) 

(118) 

(119) 
(120) 



pf(M j4s ) = d Pl a m (Qi, Pi) = Cm, 



(126) 



The total Hamiltonian as usual is 



h t = P1Q2 + P2Q3 + PMQu Q2, Q3.P3, Qs) 

-/(Qi, O2, Q 3 , h, Q 5 ) - (Q 2 - s(Qi))- (121) 

Since the second class constraints only tell us how to 
express Q 3 , P3, Q5, P5 in terms of Qi, they do not enter 
into the discussion of the linear instability that we are 
interested in (as we have seen in the proof with all second 
class constraints). We thus consider only the first class 
constraints and their conjugate gauge fixing functions. 

After gauge fixing, the constraints are 



$3 
$4 



Pi = 

Q 4 -P(Qi,Pi) «0 

Q 2 -S(Qi)»0 

P 2 -G 2 (Qi,Pi)«0. 



(122) 



With the constraints, the reduced Hamiltonian can be 
written as 

H R = P l g(Qi) + Gi(Q 1 ,P 1 )Q 3 g'(Qi) + J 7 (Qi) 
= g(Qi) (Pi + g'(Qi)G 2 (Qi, Pi)) + HQi), 

(123) 

where T{Q\) can be write explicitly if one substitutes all 
the constraints into the Hamiltonian. On the other hand, 
the pfaffian of Mab where 1 < A, B < 4 is 

pf (01,02,03,04) = -(1 + g'(Qi)d Pl G 2 (Qi, Pi)) 
= -a Pl (Pi+5'(Qi)G 2 (Qi,Pi)). (124) 



By comparing eqs. p23[) . p24p . we find for N = 3, 
if pf(fli, a 2 , 03, 04) = — dp 1 a(Qi, Pi), then the reduced 
Hamiltonian can be written as Hr = g(Qi)cx(Qi, Pi) + 

HQi)- 

Since we require the pfaffian to be a non- vanishing con- 
stant c, we can integrate eq. (|124[) to obtain a(Qi, Pi) = 
— cPi + Q(Qi). The reduced Hamiltonian is thus Hr = 
—cPig(Qi) + T'lQi), which is linearly unstable. 

Now, as usual in induction, we assume for N — m, this 
statement is true, i.e. 



where 1 < A, B, < 2m — 2. We can solve for a m = 
c m Pi + Qm{Qi)- One might notice that there might be 
a ± sign in front of the pfaffian, but the sign will not 
affect the determinant and the linear instability in the 
reduced Hamiltonian. The first class constraints with 
their conjugate gauge fixing functions are 

$1 : P m+ i = 
$ 2 :Q m +i-P(Qi,Pi)«0 
$ 3 :Q 2 -5(Qi)«0 
$ 4 :P 2 ~G 2 (Qi,Pi)«0 



$2k-i:Qk-(O ik ---O i3 )g(Qi)~0 

*a*:A-G fc (Qi,i\)«0 (127) 



where 3 < k < m — 1 and O ik = (2j=i Qi+idQt) is an 
operator. 

We now want to show that for N = m + 1, this state- 
ment is true. In this case, there will be one more first 
class constraint with its gauge fixing function 

$2m-l =Qm~ {O im ■ ■ ■ 6 l3 )g{Ql) ~ 
^2m=P m -G m {Q 1 ,P 1 )^Q. (128) 

The total Hamiltonian and reduced Hamiltonian are thus 
H T = P1Q2 + ■■■ + Pm-lQm + P m Q m +i + ■■■ (129) 



Hr 



g{Qx)oi m {Q\,P\) + G r , 
(d lm+1 ---d l3 ) g (Qi) + 



,,{Qi,Px) x 

J~ m + l(Ql)- 



(130) 



Hr = g{Qi)u m {QuPi) + F m (Qi) 



(125) 



the last • ■ • in eq. (I129[) includes the terms which turn out 
to be functions of Qi onr y on the reduced phase space. 

Now, we want to calculate the pfaffian of the new Mj,b, 
1 < A,B < 2m. One should notice there are two impor- 
tant characteristics of the pfaffian. The first is that all 
the Gi appear in pfaffian in the form of d Pl Gi. Since 
the only non- vanishing elements of the pfaffian are those 
pf (o2i, 02j-i), and since all the <I> 2 fc_i are only depen- 
dent the canonical coordinates Q±, . . . , Qk, thus the non- 
vanishing pfaffian which dependent on Gi takes the form 
dp 1 GidQ 1 &2j-i- The second characteristic is that the 
pfaffian is at most linearly dependent on those d Pl Gi. 
Taking pf (03, 04, 05, a^) as an example, the terms non- 
lincarly dependent on Gi are 

pf(a3,a 4 )pf(a 5 ,a 6 ) D (d Ql $sd Pl G 2 )(d Ql $ 5 d Pl G 3 ) 

(131) 

pf(a 3 ,a 6 )pf(a4,a 5 ) D (dQ 1 <f> 3 dp 1 G 3 )(-dQ 1 <S> 5 dp 1 G 2 )- 

(132) 
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Since the non-linear terms in eqs. (|131l) and (|132[) can- 
cel each other, we conclude that all the nonlinear terms 



Note that we only keep the terms without Gi in the last 
line, which is also only a function of Qi. 

In other words, the second characteristic that the 
pfafhan is at most linearly dependent on those d Pl Gi 
means that in its decomposition as the product 
of the basic element pf (<X2i<jt2j-i) ; in every prod- 
uct pf (a2i 1 a2ji-i)pf(a2i 2 -i a 2j2-i) • ■ • i we cannot use 
(8q 1 , dp 1 ) more than once. Thus when we evaluate 
the last line of eq. (|133[) . we use the modified Pois- 
son bracket [A, B]pi = dQ i Adp i B — 9p 4 A<3(j 4 -B where 
i = 2,3,...,m + 3, and thus the last line is a function of 
canonical coordinates Qi only. 

We can further reduce the pfafhan eq. f|l 33[) by realiz- 
ing that pf(ai, . . . ,a 2m -2) = dp 1 a m (Qx,Px), and hence 



pf(oi, . . . ,a 2m ) = d Pl a m (Q ll P 1 ) 

+d Pl G m x £{Q U Q 2 , . . .,Q m -i) 

(134) 



where £ is a function dependent on Qi, . . . , Q m -x only. 

Requiring that the pfafhan to be a constant Cm+i and 
from the assumption eq. (I126p . we can solve for G m 

dp 1 G m x £(Qi,Q 2 , . . . , Qm-i) = 

. /-i _ ( C m+X — Cm) Pi p (n \ 
=?■ Lrm — £ r Wm+lWlJ 

=* G m = (C? " +1 , C ^\ Pl + G m+ x{Qx). (135) 

In the last equality, we substitute all the constraints to 
rewrite Q 2 , • • ■ , Qm+i as functions of Q\. We can now 
substitute eq. (|135|) and a m back into eq. (|130l) to get 



cannot survive in the pfafhan which now looks like the 
following rather intimidating form 



■ , 0, 2m - 2 ) 

■ ■ , a 2m -2)) ( w hh no Gi). 

(133) 



the reduced Hamiltonian 

H R = ~g{Q l )a m {Q l> P 1 ) + G m {Q u P l ) 

x(<5i m+ i ' ' • OrMQi) + Fm+x{Qx) 

= ~ 9 {Qx)(cmPx + Gm(Qx)) + ^ ~ ^ 
v It, m+X(Wl) 

+J~' \n+x(Qx) 

= Px(c m g(Qi) + {( Z +1 ~Zi )+r' m+ x(Qx), 

v t m+X(Wl) ' 

(136) 

which is linearly dependent on P\ and thus unstable. 
Hence the proof is complete. 

VI. SUMMARY 

We prove in general that the linear instability in a non- 
degenerate higher derivative theory cannot be removed 
by the addition of constraint terms into the theory. Our 
proof is general for any iV-th order non-degenerate higher 
derivative theories with M auxiliary variables. 

Furthermore, the addition of constraints can reduce the 
effective dimensionality of the phase space of the original 
theory. We show that in a general non-degenerate higher 
derivative theory, the last dynamical degree of freedom 
which survives such a reduction is always unstable. 

In the special case of first class constraints, it is clear 
from a physics standpoint that since such constraints can 
be gauged away by the introduction of gauge fixing func- 
tions, first class constraints do not change the physics 
of the original (and unstable) theory. However, the im- 
position of gauge-fixing functions is subtle - one is not 
allowed to freely choose both the gauge-fixing functions 
and the functional form of the constraints in the action 
itself. In both cases, the restriction is due to the fact 
that gauge fixing functions must be chosen such that the 



pf(fll,.. 


•,«2m) = pf(0l,... 


■ , fl2m-2)pf (fl2m 




= pf(dl,... 


■ , a-im-i) + 9Qi 1 




m—1 


fc-2 




+ E* 

k=X 


i=X 



m — X 



k=X 



m-2 



i=X 
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gauge orbits are crossed once and only once. We proved 
that this is a sufficient condition to demonstrate that the 
instability is not removed. 

To conclude, we have demonstrated that the class of 
theories affected by Ostrogradski's instability extend to 
those which possess both first and second class con- 
straints and hence strengthening the power of Ostrograd- 
ski's Theorem. 
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Appendix A: Pfaffians 

Here we demonstrate the relation between determinant 
and pfaffian of 2n x 2n anti-symmetric matrix pij . The 
basic element of a pfaffian is pf(ai, 02) where ai and 02 
are characters. The pfaffians are anti-symmetric func- 
tions with respect to characters, pf (ax, 02) = — pf (02, ai). 
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